Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WIIfCH WERE INSERTED DY PTO SOFTWARE 



.Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word processor a ftcr creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

_In valid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

_Misaligncd Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 



Non-ASCII 



_Variable Length 



_PatentIn 2.0 
"bug" 



The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 

Sequences) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patcntln version -2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences^) . Normally, Patenlln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



. missing. If intentional, please insert the following lines for each slapped sequence: 



_Skippcd Sequences Sequences) 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO wherc~"X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 



(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ED NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

missing. If intentional, please insert the following lines for each skipped sequence. 



_Skippcd Sequences Sequences) 
(NEW RULES) <2 1 0> sequence id number 
<400> sequence id number 
000 

_Usc of n's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

(NEW RULES) Per 1.823 of Sequence Rules , use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



10 



_Invalid <213> 
Response 

Uscof<220> 



Per 1.823 of Sequence Rules, the only valid <2I3> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2 13> response is Unknown or 
is Artificial Sequence 



Sequences) 



_ missing the <220> "Feature" and associated numeric identifiers and responses. 



_PatentIn 2.0 
"bug" 



Use of <220> to <223> is MANDATORY if <21 3> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06/0 1/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patenlln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



AMC - Biotechnology Systems Branch - 06704/2001 



ti 



OIPE 



PAGE: 1 RAW SEQUENCE LISTING DATE: 11/10/2001 

PATENT APPLICATION US/09/849, 980A TIME: 01:51:11 

INPUT SET: S36663.raw 



This Raw Listing contains the General DOOS Not Comolv 
Information Section and those Sequences Qqj rected Diskette M a 
containing ERRORS. 6TO Nee <tecf 



1 SEQUENCE LISTING 
2 

3 (1) General Information 
4 

5 (i) APPLICANT: SANTEN PHARMACEUTICAL CO., LTD. 

6 (ii) TITLE OF INVENTION: Novel Polypeptide Having Water Channel 

7 Activity and DNA sequence 

8 (iii) NUMBER OF SEQUENCES: 2 

9 (iv) CORRESPONDENCE ADDRESS: 

10. (A) ADDRESSEE: SANTEN PHARMACEUTICAL CO., LTD. 

11 (B) STREET: 9 19 Shimoshinjo 3-chome Higashiyodogawa-Ku 

12 (C) CITY: Osaka 

13 (D) STATE: Osaka 

14 (E) COUNTRY: JAPAN 

15 (F) ZIP: 533-0021 

16 (v) COMPUTER READABLE FORM: 

17 (A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 MB, storage 

18 (B) COMPUTER: IBM PS/2 or compatibles 

19 (C) OPERATING SYSTEM: WINDOWS 95/97 

20 (D) SOFTWARE: Microsoft Word 97 

21 (vi) CURRENT APPLICATION DATE: 

--> 22 (A) APPLICATION NUMBER: 09/381 , 810 

23 (B) FILING DATE: 19 -OCT- 1999 

--> 24 (C) CLASSIFICATION: 435 

25 (vii) PRIOR APPLICATION DATA 

26 (A) APPLICATION NUMBER*: PCT/ JP98/0 13 71 

27 (B) FILING DATE: 27-MAR-1998 

28 (C) APPLICATION NUMBER: JP 09-094845 

29 (D) FILILNG DATE: 28 -MAR- 1997 

3 0 (viii) ATTORNEY/AGENT INFORMATION: 

31 (A) NAME: Burton A. Amernick 

32 (B) REGISTRATION NUMBER: 24852 

33 (C) REFERENCE/DOCKET NUMBER: 1581/00156 

34 (ix) TELECOMMUNICATION INFORMATION: 

35 (A) TELEPHONE: (202)331-7111 

36 (B) FAX: (202)293-6229 
37 



ERRORED SEQUENCES FOLLOW: 



38 (2) INFORMATION FOR SEQ ID NO : 1: 
39 



PAGE: 2 RAW SEQUENCE LISTING DATE: 1 1/10/2001 

PATENT APPLICATION US/09/849,980A TIME: 01:51:12 

INPUT SET: S36663.raw 

40 (i) SEQUENCR^C^RACTERISTICS : "l/HJaliAl 
--> 41 (A) LENGTH (F3 4 £>amino acids ' j. I|C * r* 

42 (b) type: a^T acid J> OevmoT r 

--> 43 (D) TOPOLOGY© linear 

44 (ii) MOLECULE'" TYPE : F peptide 
45 

--> 46 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

47 

48 Met Val Gin Ala Ser Gly His Arg Arg Ser Thr Arg Gly Ser Lys Met 

49 5 10 15 

50 Val Ser Trp Ser Val lie Ala Lys lie Gin Glu lie Leu Gin Arg Lys 

51 20 25 30 

52 Met Val Arg Glu Phe Leu Ala Glu Phe Met Ser Thr Tyr Val Met Met 

53 35 40 45 

54 val Phe Gly Leu Gly Ser Val Ala His Met Val Leu Asn Lys Lys Tyr 

55 50 55 60 

56 Gly Ser Tyr Leu Gly Val Asn Leu Gly Phe Gly Phe Gly Val Thr Met 

57 65 70 75 80 

58 Gly Val His Val Ala Gly Arg lie Ser Gly Ala His Met Asn Ala Ala 

59 85 90 95 

60 Val Thr Phe Ala Asn Cys Ala Leu Gly Arg Val Pro Trp Arg Lys Phe 

61 100 105 110 

62 Pro Val Tyr Val Leu Gly Gin Phe Leu Gly Ser Phe Leu Ala Ala Ala 

63 115 120 125 

64 Thr lie Tyr Ser Leu Phe Tyr Thr Ala lie Leu His Phe Ser Gly Gly 

65 130 135 140 

66 Gin Leu Met Val Thr Gly Pro Val Ala Thr Ala Gly lie Phe Ala Thr 

67 145 150 155 160 

68 Tyr Leu Pro Asp His Met Thr Leu Trp Arg Gly Phe Leu Asn Glu Ala 

69 165 170 175 

7 0 Trp Leu Thr Gly Met Leu Gin Leu Cys Leu Phe Ala Thr Thr Asp Gin 

71 180 185 190 

72 Glu Asn Asn Pro Ala Leu Pro Gly Thr Glu Ala Leu Val lie Gly lie 

73 195 200 205 

74 Leu Val Val lie lie Gly Val Ser Leu Gly Met Asn Thr Gly Tyr Ala 

75 210 215 220 

76 He Asn Pro Ser Arg Asp Leu Pro Pro Arg He Phe Thr Phe He Ala 

77 225 230 235 240 

78 Gly Trp Gly Lys Gin Val Phe Ser Asn Gly Glu Asn Trp Trp Trp Val 

79 245 250 255 

8 0 Pro Val Val Ala Pro Leu Leu Gly Ala Tyr Leu Gly Gly He He Tyr 

81 260 265 270 

82 Leu Val Phe He Gly Ser Thr He Pro Arg Glu Pro Leu Lys Leu Glu 

83 275 280 285 

84 Asp Ser Val Ala Tyr Glu Asp His Gly He Thr Val Leu Pro Lys Met 

85 290 295 300 

86 Gly Ser His Glu Pro Thr He Ser Pro Leu Thr Pro Val Ser Val Ser 

87 305 310 315 320 

88 Pro Ala Asn Arg Ser Ser Val His Pro Ala Pro Pro Leu His Glu Ser 

89 325 330 335 

90 Met Ala Leu Glu His Phe 

91 340 
92 



PAGE: 3 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/849, 980A 



DATE: 11/10/2001 
TIME: 01:51:12 



INPUT SET: S3 666 3. raw 



93 (2) INFORMATION FOR. SEQ ID NO : 2 : 

94 (i) SEQUENCE^LARACTERISTICS : 

--> 95 (A) LENGTI^Jl258^>ase pairs o^t^ 

96 (B) TYPE: nucleic acid p <>®^ 

97 {C) STRANDEDNESS: double / 
--> 98 (D) TOPOLOGY jF_jXnear 

9 9 (ii) MOLECULE TYPE: CDNA to mRNA 

100 (vi) ORIGINAL SOURCE: 

101 (A) ORGANISM: Homo sapiens 

102 (B) TISSUE TYPE: fat tissue 

103 (ix) FEATURE: 

104 (A) NAME/KEY: exon 

105 (B) LOCATION: F173 .. 1198 

106 (C) IDENTIFICATION METHOD: by experiment 

107 (C) IDENTIFICATION METHOD: by experiment 

--> 108 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: * 

110 GGCTCTGGAC TGGGGACACA GGGATAGCTG AGCCCCAGCT ) (jJ^ 5 ^ ± . y . 

--> 111 GGGGGTGGAA GCTGAGCCAG 60 ( < SvUUU /)t&^ X 

112 GGACAGTCAC GGAGGAACAA GATCAAGATG CGCTGTAACT > 

113 GAGAAGCCCC CAAGGCGGAG 120 \ 

114 GCTGAGAATC AGAGACATTT CAGCAGACAT CTACAAATCT ) 
-> 115 GAAAGACAAA AC ATG GTT 178 ^ 

116 Met Val 

117 ! 

118 CAA GCA TCC GGG CAC AGG CGG TCC ACC CGT GGC TCC AAA ^ 
-> 119 ATG GTC TCC 226 \ 

12 0 Gin Ala Ser Gly His Arg Arg Ser Thr Arg Gly Ser Lys Met Val Ser \ 

121 5 10 15 

122 TGG TCC GTG ATA GCA AAG ATC CAG GAA ATA CTG CAG AGG / 
-> 123 AAG ATG GTG 274 / 

124 Trp Ser Val He Ala Lys He Gin Glu He Leu Gin Arg Lys Met Val / 

125 20 25 30 / 

126 CGA GAG TTC CTG GCC GAG TTC ATG AGC ACA TAT GTC ATG / 
-> 127 ATG GTA TTC 322 / 

128 Arg Glu Phe Leu Ala Glu Phe Met Ser Thr Tyr Val Met Met Val Phe 

129 35 40 45 50 \ r 

130 GGC CTT GGT TCC GTG GCC CAT ATG GTT CTA AAT AAA AAA \ ^-O/V*"^ 

-> 131 TAT GGG AGC 370 \ ^ ^>ty 

132 Gly Leu Gly Ser Val Ala His Met Val Leu Asn Lys Lys Tyr Gly Ser ) 

133 55 60 65 / 

134 TAC CTT GGT GTC AAC TTG GGT TTT GGC TTC GGA GTC ACC / 
-> 135 ATG GGA GTG 418 I 

13 6 Tyr Leu Gly Val Asn Leu Gly Phe Gly Phe Gly Val Thr Met Gly Val \ 
137 70 75 80 \ 

13 8 CAC GTG GCA GGC CGC ATC TCT GGA GCC CAC ATG AAC GCA \ 
-> 139 GCT GTG ACC 466 \ 

14 0 His Val Ala Gly Arg He Ser Gly Ala His Met Asn Ala Ala Val Thr 

141 85 90 95 / 

142 TTT GCT AAC TGT GCG CTG GGC CGC GTG CCC TGG AGG AAG / 
-> 143 TTT CCG GTC 514 / 

144 Phe Ala Asn Cys Ala Leu Gly Arg Val Pro Trp Arg Lys Phe Pro Val / 



PAGE: 4 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/849, 980 A 



DATE: 11/10/2001 
TIME: 01:51:13 



- - > 



- -> 



- - > 



- -> 



- - > 



--> 



- - > 



145 

146 

147 

148 

149 

150 

151 

152 

153 

154 

155 

156 

157 

158 

159 

160 

161 

162 

163 

164 

165 

166 

167 

168 

169 

170 

171 

172 

173 

174 

175 

176 

177 

178 

179 

180 

181 

182 

183 

184 

185 

186 

187 

188 

189 

190 

191 

192 

193 

194 

195 

196 

197 



TAT 
GCC 
Tyr 
115 
TAC 
GGA 
Tyr 



100 
GTG 
ACC 

val 

AGT 
CAG 
Ser 



ATG GTG 
ACC TAC 
Met Val 

CCT GAT 
GCG TGG 
Pro Asp 



ACC 
CAG 
Thr 

AAC 
GGC 
Asn 
195 
GTC 
GCC 
Val 



GGG 
GAG 
Gly 
180 
CCA 
ATC 
Pro 

ATC 
ATC 
He 



CCG TCC 
GCT GGT 
Pro Ser 

GGC AAA 
TGG GTG 
Gly Lys 



GTG 
TAC 

Val 

TTC 
GAG 
Phe 
275 
GTG 
ATG 
Val 



GCA 
CTG 
Ala 
260 
ATT 
GAT 
He 

GCG 
GGA 
Ala 



CAT GAA 
AGC CCT 
His Glu 



INPUT SET: S36663.raw 

105 HO 
CTG GGG CAG TTC CTG GGC TCC TTC CTG GCG GCT 
ATC 562 

Leu Gly Gin Phe Leu Gly Ser Phe Leu Ala Ala Ala Thr He 
120 125 130 

CTC TTC TAC ACG GCC ATT CTC CAC TTT TCG GGT 
CTG 610 

Leu Phe Tyr Thr Ala He Leu His Phe Ser Gly Gly Gin Leu 

135 140 . 145 

ACC GGT CCC GTC GCT ACA GCT GGC ATT TTT GCC 
CTT 658 

Thr Gly Pro Val Ala Thr Ala Gly He Phe Ala Thr Tyr Leu 

150 155 160 

CAC ATG ACA TTG TGG CGG GGC TTC CTG AAT GAG 
CTG 706 

His Met Thr Leu Trp Arg Gly Phe Leu Asn Glu Ala Trp Leu 
165 170 175 

ATG CTC CAG CTG TGT CTC TTC GCC ATC ACG GAC 
AAC 754 

Met Leu Gin Leu Cys Leu Phe Ala Thr Thr Asp Gin Glu Asn 

185 190 
GCA CTG CCA GGA ACA GAG GCG CTG GTG ATA 
CTC GTG 802 

Ala Leu Pro Gly Thr Glu Ala Leu Val He Gly He Leu Val 
200 205 210 

ATC GGG GTG TCC CTT GGC ATG AAC ACA GGA TAT 
AAC 850 

lie Gly Val Ser Leu Gly Met Asn Thr Gly Tyr Ala He Asn 

215 220 225 

CGG GAC CTG CCC CCC CGC ATC TTC ACC TTC ATT 
TGG 898 

Arg Asp Leu Pro Pro Arg He Phe Thr Phe He Ala Gly Trp 

230 235 240 

CAG GTC TTC AGC AAT GGG GAG AAC TGG TGG 
CCA GTG 946 

Gin Val Phe Ser Asn Gly Glu Asn Trp Trp Trp Val Pro Val 
245 250 255 

CCA CTT CTG GGT GCC TAT CTA GGT GGC ATC ATC 
GTC 994 

Pro Leu Leu Gly Ala Tyr Leu Gly Gly He He Tyr Leu Val 

265 270 
GGC TCC ACC ATC CCA CGG GAG CCC CTG AAA TTG 
TCT 1042 

Gly Ser Thr He Pro Arg Glu Pro Leu Lys Leu Glu Asp Ser 
280 285 290 

TAT GAA GAC CAC GGG ATA ACC GTA TTG CCC AAG 
TCT 1090 

Tyr Glu Asp His Gly He Thr Val Leu Pro Lys Met Gly Ser 

295 300 305 

CCC ACG ATC TCT CCC CTC ACC CCC GTC TCT GTG 
GCC 1138 

Pro Thr He Ser Pro Leu Thr Pro Val Ser Val Ser Pro Ala 
310 315 320 




PAGE: 5 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/849, 980A 



DATE: 11/10/2001 
TIME: 01:51:13 



INPUT SET: S3 666 3. raw 

198 AAC AGA TCT TCA GTC CAC CCT GCC CCA CCC TTA CAT GAA 

199 TCC ATG GCC 1186 

200 Asn Arg Ser Ser Val His Pro Ala Pro Pro Leu His Glu Ser Met Ala 

201 325 330 335 V ^ ^ 

202 CTA GAG CAC TTC TAAGCAGAGA TTATTTGTGA TCCCATCCAT >f9 Yr ^-" 

203 TCCCCAATAA 1238 1 

204 Leu Glu His Phe 

205 340 

206 AGCAAGGCTT GTCCGACAAA " 1258 
207 



PAGE: 



1 



SEQUENCE VERIFICATION REPORT 
PATENT APPLICATION US/09/849, 980A 



DATE: 11/10/2001 
TIME: 01:51:13 



INPUT SET: S36663.raw 



Line 

22 

24 

41 

41 

43 

46 

95 

95 

98 

108 

111 

113 

115 

119 

123 

127 

131 

135 

139 

143 

147 

151 

155 

159 

163 

167 

171 

175 

179 

183 

187 

191 

195 

199 

203 



Error 

Wrong application Serial Number 

Wrong Classification 

Length must be an Integer 

Entered (0) and Calc. Seq. Length (342) differ 

Wrong or Missing Sequence Topology 

Wrong or Missing Sequence Topology 

Length must be an Integer 

Entered (0) and Calc. Seq. Length (283) differ 

Wrong or Missing Sequence Topology 

Wrong or Missing Sequence Topology 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 

# of Sequences for line conflicts w/ running total 



Original Text 

(A) APPLICATION NUMBER: 09/381,810 

(C) CLASSIFICATION: 435 
(A) LENGTH F342 amino acids 
(A) LENGTH F342 amino acids 

(D) TOPOLOGYF linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
(A) LENGTHF1258 base pairs 
(A) LENGTHF1258 base pairs 
(D) TOPOLOGYF linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGGGGTGGAA GCTGAGCCAG 60 

GAGAAGCCCC CAAGGCGGAG 120 

GAAAGACAAA AC ATG GTT 178 

ATG GTC TCC 226 

AAG ATG GTG 274 

ATG GTA TTC 322 

TAT GGG AGC 370 

ATGGGAGTG 418 

GCT GTG ACC 466 

TTTCCGGTC 514 

GCC ACC ATC 562 

GGA CAG CTG 610 

ACC TAC CTT 658 

GCG TGG CTG 706 

CAG GAG AAC 754 

GGC ATC CTC GTG 802 

GCC ATC AAC 850 

GCT GGT TGG 898 

TGG GTG CCA GTG 946 

TAC CTG GTC 994 

GAGGATTCT 1042 

ATGGGATCT 1090 

AGCCCTGCC 1138 

TCC ATG GCC 1186 

TCCCCAATAA 1238 



SEQUENCE CORRECTION REPORT 
PATENT APPLICATION US/09/849, 980A 



DATE: 11/10/2001 
TIME: 01:51:13 



INPUT SET: S36663.raw 



Original Text 

(1) General Information 

(vi) CURRENT APPLICATION DATE: 

(vii) PRIOR APPLICATION DATA 
(D) FILILNG DATE: 28-MAR-1997 
(B) FAX: (202)293-6229 

(ii) MOLECULE TYPEF peptide 

(2) INFORMATION FOR SEQ ID NoF2 
(B) LOCATIONF173..1198 



Corrected Text 

(1) GENERAL INFORMATION: 

(vi) CURRENT APPLICATION DATA: 

(vii) PRIOR APPLICATION DATA: 
(D) FILING DATE: 28-MAR-1997 
(B) TELEFAX: (202)293-6229 

(ii) MOLECULE TYPE:F peptide 

(2) INFORMATION FOR SEQ ID NO:2: 
(B) LOCATION:F173..1198 



